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<210> 1 

<211> 5454 

<212> DNA 

<213> Escherichia coli 



<220> 

<223> Expression plasmid pPA-GBP, for His6-protein A-GBP. Vector is 

PQE-80L (Qiagen) . CDS for Protein A (nucleotides 160-528) is from 
pEZZ18 (Amersham) . CDS for GBP (nucleotides 565-858) is from 
pSB3053 from (Brown, Nat. Biotechnol. 15:269-272, 1997). 



<220> 

<221> CDS 

<222> (115) . . (885) 

<223> CDS for fusion protein His6-protein A-GBP consisting of 257 amino 
acid residues. 

<400> 1 

ctcgagaaat cataaaaaat ttatttgctt tgtgagcgga taacaattat aatagattca 60 

attgtgagcg gataacaatt tcacacagaa ttcattaaag aggagaaatt aact atg 117 

Met 
1 

aga gga teg cat cac cat cac cat cac gga tec ggt tct ggt gcg caa 165 
Arg Gly Ser His His His His His His Gly Ser Gly Ser Gly Ala Gin 
5 10 15 

cac gat gaa gcc gta gac aac aaa ttc aac aaa gaa caa caa aac gcg 213 
His Asp Glu Ala Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 
20 25 30 

ttc tat gag ate tta cat tta cct aac tta aac gaa gaa caa cga aac 261 
Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn 
35 40 45 

gcc ttc ate caa agt tta aaa gat gac cca age caa age get aac ctt 309 
Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu 



I 



55 



65 



tta gca gaa get aaa aag eta aat gat get cag gcg ccg aaa gta gac 357 

Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Val Asp 
70 75 80 

aac aaa ttc aac aaa gaa caa caa aac gcg ttc tat gag ate tta cat 405 

Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His 
85 90 95 

tta cct aac tta aac gaa gaa caa cga aac gee ttc ate caa agt tta 453 

Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu 
100 105 110 

aaa gat gac cca age caa age get aac ctt tta gca gaa get aaa aag 501 

Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys 
115 ~ 120 125 

eta aat gat get cag gcg ccg aaa gta gac gcg aat teg age tct ggt 549 

Leu Asn Asp Ala Gin Ala Pro Lys Val Asp Ala Asn Ser Ser Ser Gly 

130 135 140 145 

agt ggc aat ggt cat atg cat gga aaa act cag gca ace age ggg act 5 97 

Ser Gly Asn Gly His Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr 
150 155 160 

ate cag age atg cat gga aaa act cag gca ace age ggg act ate cag 64 5 

He Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin 
165 170 175 

age atg cat gga aaa act cag gca acc age ggg act ate cag age atg 693 

Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met 
180 185 190 

cat gga aaa act cag gca acc age ggg act ate cag age atg cat gga 741 

His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly 
195 200 205 

aaa act cag gca acc age ggg act ate cag age atg cat gga aaa act 789 

Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys Thr 

210 215 220 225 

cag gca acc age ggg act ate cag age atg cat gga aaa att cag gca 837 

Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys He Gin Ala 
230 235 240 

acc age ggg act ate cag age atg cat get ctg tec etc gag ggt ccg 885 
■Thr Ser Gly Thr He Gin Ser Met His Ala Leu Ser Leu Glu Gly Pro 
245 250 255 

taataagctt aattagctga gcttggactc ctgttgatag atccagtaat gacctcagaa 945 

ctccatctgg atttgttcag aacgeteggt tgccgccggg cgttttttat tggtgagaat 1005 

ccaagctagc ttggcgagat tttcaggagc taaggaagct aaaatggaga a-aaaaatcac 1065 

tggatatacc acegttgata tatcccaatg geategtaaa gaacattttg aggcatttca 1125 



gtcagttgct 


caatgtacct 


ataaccagac 


cgttcagctg 


gatattacgg 


cctttttaaa 


1 1 QC 


gaccgtaaag 


aaaaataagc 


acaagtttta 


tccggccttt 


attcacattc 


ttgcccgcct 


124 5 


gatgaatgct 


catccggaat 


ttcgtatggc 


aatgaaagac 


ggtgagctgg 


tgatatggga 


i q n r 
1 JU5 


tagtgttcac 


ccttgttaca 


ccgttttcca 


tgagcaaact 


gaaacgtttt 


catcgctctg 


i i a c; 


gagtgaatac 


cacgacgatt 


tccggcagtt 


tctacacata 


tattcgcaag 


atgtggcgtg 


1 / QC 

14 Z 0 


ttacggtgaa 


aacctggcct 


atttccctaa 


agggtttatt 


gagaatatgt 


ttttcgtctc 


1485 


agccaatccc 


tgggtgagtt 


tcaccagttt 


tgatttaaac 


gtggccaata 


tggacaactt 


i c / r 

1545 


cttcgccccc 


gttttcacca 


tgggcaaata 


ttatacgcaa 


ggcgacaagg 


tgctgatgcc 


i a c\ c 
1605 


gctggcgatt 


caggttcatc 


atgccgtttg 


tgatggcttc 


catgtcggca 


gaatgcttaa 


1665 


tgaattacaa 


cagtactgcg 


atgagtggca 


gggcggggcg 


taattttttt 


aaggcagtta 


1725 


ttggtgccct 


taaacgcctg 


gggtaatgac 


tctctagctt 


gaggcatcaa 


ataaaacgaa 


1785 


aggctcagtc 


gaaagactgg 


gcctttcgtt 


ttatctgttg 


tttgtcggtg 


aacgctctcc 


T O A C 

1845 


tgagtaggac 


aaatccgccc 


tctagattac 


gtgcagtcga 


tgataagctg 


tcaaacatga 


1905 


gaattgtgcc 


taatgagtga 


gctaacttac 


attaattgcg 


ttgcgctcac 


tgcccgcttt 


1965 


ccagtcggga 


aacctgtcgt 


gccagctgca 


ttaatgaatc 


ggccaacgcg 


cggggagagg 


2025 


cggtttgcgt 


attgggcgcc 


agggtggttt 


ttcttttcac 


cagtgagacg 


ggcaacagct 


O A O C 

2085 


gattgccctt 


caccgcctgg 


ccctgagaga 


gttgcagcaa 


gcggtccacg 


ctggtttgcc 


2145 


ccagcaggcg 


aaaatcctgt 


ttgatggtgg 


ttaacggcgg 


gatataacat 


gagctgtctt 


2205 


cggtatcgtc 


gtatcccact 


accgagatat 


ccgcaccaac 


gcgcagcccg 


gactcggtaa 


2265 


tggcgcgcat 


tgcgcccagc 


gccatctgat 


cgttggcaac 


cagcatcgca 


gtgggaacga 




tgccctcatt 


cagcatttgc 


atggtttgtt 


gaaaaccgga 


catggcactc 


cagtcgcctt 


o o o c 
2 J o 5 n 


cccgttccgc 


tatcggctga 


atttgattgc 


gagtgagata 


tttatgccag 


ccagccagac 


O /I A C 

24 45 


gcagacgcgc 


cgagacagaa 


cttaatgggc 


ccgctaacag 


cgcgatttgc 


tggtgaccca 


2505 


atgcgaccag 


atgctccacg 


cccagtcgcg 


taccgtcttc 


atgggagaaa 


ataatactgt 


2565 


tgatgggtgt 


ctggtcagag 


acatcaagaa 


ataacgccgg 


aacattagtg 


caggcagctt 


2625 


ccacagcaat 


ggcatcctgg 


tcatccagcg 


gatagttaat 


gatcagccca 


ctgacgcgtt 


2685 


gcgcgagaag 


attgtgcacc 


gccgctttac 


aggcttcgac 


gccgcttcgt 


tctaccatcg 


2745 


acaccaccac 


gctggcaccc 


agttgatcgg 


cgcgagattt 


aatcgccgcg 


acaatttgcg 


2805 



3 



acggcgcgtg 


cagggccaga 


ctggaggtgg 


caacgccaat 


cagcaacgac 


tgtttgcccg 


2865 


ccagttgttg 


tgccacgcgg 


ttgggaatgt 


aattcagctc 


cgccatcgcc 


gcttccactt 


2925 


tttcccgcgt 


tttcgcagaa 


acgtggctgg 


cctggttcac 


cacgcgggaa 


acggtctgat 


2985 


aagagacacc 


ggcatactct 


gcgacatcgt 


ataacgttac 


tggtttcaca 


ttcaccaccc 


3045 


tgaattgact 


ctcttccggg 


cgctatcatg 


ccataccgcg 


aaaggttttg 


caccattcga 


3105 


tggtgtcgga 


atttcgggca 


gcgttgggtc 


ctggccacgg 


gtgcgcatga 


tctagagctg 


3165 


cctcgcgcgt 


ttcggtgatg 


acggtgaaaa 


cctctgacac 


atgcagctcc 


cggagacggt 


3225 


cacagcttgt 


ctgtaagcgg 


atgccgggag 


cagacaagcc 


cgtcagggcg 


cgtcagcggg 


3285 


tgttggcggg 


tgtcggggcg 


cagccatgac 


ccagtcacgt 


agcgatagcg 


gagtgtatac 


3345 


tggcttaact 


atgcggcatc 


agagcagatt 


gtactgagag 


tgcaccatat 


gcggtgtgaa 


3405 


ataccgcaca 


gatgcgtaag 


gagaaaatac 


cgcatcaggc 


gctcttccgc 


ttcctcgctc 


3465 


actgactcgc 


tgcgctcggt 


cgttcggctg 


cggcgagcgg 


tatcagctca 


ctcaaaggcg 


3525 


gtaatacggt 


tatccacaga 


atcaggggat 


aacgcaggaa 


agaacatgtg 


agcaaaaggc 


3585 


cagcaaaagg 


ccaggaaccg 


taaaaaggcc 


gcgttgctgg 


cgtttttcca 


taggctccgc 


3645 


ccccctgacg 


agcatcacaa 


aaatcgacgc 


tcaagtcaga 


ggtggcgaaa 


cccgacagga 


3705 


ctataaagat 


accaggcgtt 


tccccctgga 


agctccctcg 


tgcgctctcc 


tgttccgacc 


3765 


ctgccgctta 


ccggatacct 


gtccgccttt 


ctcccttcgg 


gaagcgtggc 


gctttctcat 


3825 


agctcacgct 


gtaggtatct 


cagttcggtg 


taggtcgttc 


gctccaagct 


gggctgtgtg 


3885 


cacgaacccc 


ccgttcagcc 


cgaccgctgc 


gccttatccg 


gtaactatcg 


tcttgagtcc 


3945 


aacccggtaa 


gacacgactt 


atcgccactg 


gcagcagcca 


ctggtaacag 


gattagcaga 


4005 


gcgaggtatg 


taggcggtgc 


tacagagttc 


ttgaagtggt 


ggcctaacta 


cggctacact 


4065 


agaaggacag 


tatttggtat 


ctgcgctctg 


ctgaagccag 


ttaccttcgg 


aaaaagagtt 


4125 


ggtagctctt 


gatccggcaa 


acaaaccacc 


gctggtagcg 


gtggtttttt 


tgtttgcaag 


4185 


cagcagatta 


cgcgcagaaa 


aaaaggatct 


caagaagatc 


ctttgatctt 


ttctacgggg 


4245 


tctgacgctc 


agtggaacga 


aaactcacgt 


taagggattt 


tggtcatgag 


attatcaaaa 


4305 


aggatcttca 


cctagatcct 


tttaaattaa 


aaatgaagtt 


ttaaatcaat 


ctaaagtata 


4365 


tatgagtaaa 


cttggtctga 


cagttaccaa 


tgcttaatca 


gtgaggcacc 


tatctcagcg 


4425 


atctgtctat 


ttcgttcatc 


catagttgcc 


tgactccccg 


tcgtgtagat 


aactacgata 


4485 


cgggagggct 


taccatctgg 


ccccagtgct 


gcaatgatac 


cgcgagaccc 


acgctcaccg 


4545 



gctccagatt 


tatcagcaat 


aaaccagcca 


gccggaaggg 


ccgagcgcag 


aagtggtcct 


4 oUb 


gcaactttat 


ccgcctccat 


ccagtctatt 


aattgttgcc 


gggaagctag agtaagtagt 


4 boo 


tcgccagtta 


atagtttgcg 


caacgttgtt 


gccattgcta 


caggcatcgt 


qqtqtcacqc 


4 / Zd 


tcgtcgtttg 


gtatggcttc 


attcagctcc 


ggttcccaac 


qatcaaqqcq 


agttacatga 


A *7 O C 

4785 


tcccccatgt 


tgtgcaaaaa 


agcggttagc 


tccttcggtc 


ctccgatcgt 


tgtcagaagt 


4845 


aagttggccg 


cagtgttatc 


actcatggtt 


atggcagcac 


tgcataatt c 


tcttactgt c 


4 905 


atgccatccg 


taagatgctt 


ttctgtgact 


ggtgagtact 


caaccaagtc 


attctgagaa 


4965 


tagtgtatgc 


ggcgaccgag 


ttgctcttgc 


ccggcgtcaa 


tacgggataa 


taccgcgcca 


5025 


catagcagaa 


ctttaaaagt 


gctcatcatt 


ggaaaacgtt 


cttcggggcg 


aaaactctca 


5085 


aggatcttac 


cgctgttgag 


atccagttcg 


atgtaaccca 


ctcgtgcacc 


caactgatct 


5145 


tcagcatctt 


ttactttcac 


cagcgtttct 


gggtgagcaa 


aaacaggaag 


gcaaaatgcc 


5205 


gcaaaaaagg 


gaataagggc 


gacacggaaa 


tgttgaatac 


tcatactctt 


cctttttcaa 


5265 


tattattgaa 


gcatttatca 


gggttattgt 


ctcatgagcg 


gatacatatt 


tgaatgtatt 


5325 


tagaaaaata 


aacaaatagg 


ggttccgcgc 


acatttcccc 


gaaaagtgcc 


acctgacgtc 


5385 


taagaaacca 


ttattatcat 


gacattaacc 


tataaaaata 


ggcgtatcac gaggcccttt 


5445 


cgtcttcac 












5454 



<210> 2 

<211> 5448 

<212> DNA 

<213> Escherichia coli 



<220> 

<223> Expression plasmid pStreptavidin-GBP for His6-streptavidin-GBP . 

PQE-80L vector (Qiagen) . CDS core streptavidin ( 169-531 ) , (Chilkoti 
et al., Proc Natl Acad Sci USA 92:1754-1758, 1995). CDS GBP 
(559-852), pSB3053, (Brown, Nat. Biotechnol. 15:269-272, 1997). 



<220> 

<221> CDS 

<222> (115) . . (879) 

<223> CDS for fusion protein His6-streptavidin-GBP consisting of 255 
amino acid residues. 

<400> 2 

ctcgagaaat cataaaaaat ttatttgctt tgtgagcgga taacaattat aatagattca 60 

5 



attgtgagcg gataacaatt tcacacagaa ttcattaaag aggagaaatt aact atg 117 

Met 
1 

aga gga teg cat cac cat cac cat cac gga tec ggt tct ggt ggc cat 165 
Arg Gly Ser His His His His His His Gly Ser Gly Ser Gly Gly His 
5 10 15 

atg get gaa get ggt ate ace ggc ace tgg tac aac cag ctg gga tec 213 
Met Ala Glu Ala Gly lie Thr Gly Thr Trp Tyr Asn Gin Leu Gly Ser 

20 25 30 

acc ttc ate gtt acc get ggt get gac ggt get ctg ace ggt ace tac 261 
Thr Phe He Val Thr Ala Gly Ala Asp Gly Ala Leu Thr Gly Thr Tyr 

35 40 45 

gaa tec get gtt ggt aac get gaa tct aga tac gtt ctg acc ggt cgt 309 
Glu Ser Ala Val Gly Asn Ala Glu Ser Arg Tyr Val Leu Thr Gly Arg 

50 55 60 65 

tac gac tec get ccg get acc gac ggt tec gga acc get ctg ggt tgg 357 
Tyr Asp Ser Ala Pro Ala Thr Asp Gly Ser Gly Thr Ala Leu Gly Trp 

70 75 80 

acc gtt get tgg aaa aac aac tac cgt aac get cac tec get acc acc 4 05 

Thr Val Ala Trp Lys Asn Asn Tyr Arg Asn Ala His Ser Ala Thr Thr 

85 " 90 95 

tgg tct ggc cag tac gtt ggt ggt get gaa get cgt ate aac acc cag 453 
Trp Ser Gly Gin Tyr Val Gly Gly Ala Glu Ala Arg He Asn Thr Gin 
100 105 110 

tgg ttg ttg acc tec ggc acc acc gaa get aac gcg tgg aaa tec acc 501 
Trp Leu Leu Thr Ser Gly Thr Thr Glu Ala Asn Ala Trp Lys Ser Thr 
115 120 125 

ctg gtt ggt cac gac acc ttc acc aaa gtt teg age tct ggt agt ggc 54 9 

Leu Val Gly His Asp Thr Phe Thr Lys Val Ser Ser Ser Gly Ser Gly 
130 135 140 145 

aat ggt cat atg cat gga aaa act cag gca acc age ggg act ate cag 597 
Asn Gly His Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin 

150 155 160 

age atg cat gga aaa act cag gca acc age ggg act ate cag age atg 64 5 

Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met 

165 170 175 

cat gga aaa act cag gca acc age ggg act ate cag age atg cat gga 693 
His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly 
180 185 190 

aaa act cag gca acc age ggg act ate cag age atg cat gga aaa act 741 
Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys Thr 
195 200 205 

cag gca acc age ggg act ate cag age atg cat gga aaa act cag gca 789 



Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys Thr Gin Ala 

210 215 220 225 

acc age ggg act ate cag age atg cat gga aaa att cag gca acc age 837 

Thr Ser Gly Thr He Gin Ser Met His Gly Lys He Gin Ala Thr Ser 

230 235 240 



ggg act ate cag age atg cat get ctg tec etc gag ggt ccg 879 
Gly Thr He Gin Ser Met His Ala Leu Ser Leu Glu Gly Pro 
245 250 255 



taataagctt 


aattagctga 


gcttggactc 


ctgttgatag 


atccagtaat 


gacctcagaa 


939 


ctccatctgg 


atttgttcag 


aacgeteggt 


tgccgccggg 


cgttttttat 


tggtgagaat 


999 


ccaagctagc 


ttggcgagat 


tttcaggagc 


taaggaagct 


aaaatggaga 


aaaaaatcac 


1059 


tggatatacc 


acegttgata 


tatcccaatg 


geategtaaa 


gaacattttg 


aggcatttca 


1119 


gtcagttgct 


caatgtacct 


ataaccagac 


cgttcagctg 


gatattaegg 


cctttttaaa 


1179 


gaeegtaaag 


aaaaataagc 


acaagtttta 


tccggccttt 


attcacattc 


ttgcccgcct 


1239 


gatgaatget 


catceggaat 


ttcgtatggc 


aatgaaagac 


ggtgagctgg 


tgatatggga 


1299 


tagtgttcac 


ccttgttaca 


ccgttttcca 


tgagcaaact 


gaaacgtttt 


catcgctctg 


1359 


gagtgaatac 


cacgacgatt 


teeggcagtt 


tctacacata 


tattegcaag 


atgtggcgtg 


1419 


ttacggtgaa 


aacctggcct 


atttccctaa 


agggtttatt 


gagaatatgt 


ttttegtetc 


1479 


agccaatccc 


tgggtgagtt 


tcaccagttt 


tgatttaaac 


gtggccaata 


tggacaactt 


1539 


cttcgccccc 


gttttcacca 


tgggcaaata 


ttatacgcaa 


ggegacaagg 


tgetgatgee 


1599 


getggegatt 


caggttcatc 


atgccgtttg 


tgatggcttc 


catgteggea 


gaatgcttaa 


1659 


tgaattacaa 


cagtactgeg 


atgagtggca 


gggcggggcg 


taattttttt 


aaggcagtta 


1719 


ttggtgccct 


taaacgcctg 


gggtaatgac 


tctctagctt 


gaggcatcaa 


ataaaacgaa 


1779 


aggctcagtc 


gaaagactgg 


gectttegtt 


ttatctgttg 


tttgtcggtg 


aacgctctcc 


1839 


tgagtaggac 


aaatccgccc 


tctagattac 


gtgeagtega 


tgataagctg 


tcaaacatga 


1899 


gaattgtgcc 


taatgagtga 


gctaacttac 


attaattgeg 


ttgcgctcac 


tgcccgcttt 


1959 


ecagteggga 


aacctgtcgt 


gccagctgca 


ttaatgaatc 


ggccaacgcg 


eggggagagg 


2019 


cggtttgcgt 


attgggegee 


agggtggttt 


ttcttttcac 


cagtgagacg 


ggcaacagct 


2079 


gattgecett 


caccgcctgg 


ccctgagaga 


gttgcagcaa 


gcggtccacg 


ctggtttgcc 


2139 


ccagcaggcg 


aaaatcctgt 


ttgatggtgg 


ttaaeggegg 


gatataacat 


gagctgtctt 


2199 


eggtategtc 


gtatcccact 


accgagatat 


ccgcaccaac 


gcgcagcccg 


gaeteggtaa 


2259 



1 



tggcgcgcat 


tgcgcccagc 


gccatctgat 


cgttggcaac 


cagcatcgca 


gtgggaacga 


2319 


tgccctcatt 


cagcatttgc 


atggtttgtt 


gaaaaccgga 


catggcactc 


cagtcgcctt 


2379 


cccgttccgc 


tatcggctga 


atttgattgc 


gagtgagata 


tttatgccag 


ccagccagac 


2439 


gcagacgcgc 


cgagacagaa 


cttaatgggc 


ccgctaacag 


cgcgatttgc 


tggtgaccca 


2499 


atgcgaccag 


atgctccacg 


cccagtcgcg 


taccgtcttc 


atgggagaaa 


ataatactgt 


2559 


tgatgggtgt 


ctggtcagag 


acatcaagaa 


ataacgccgg 


aacattagtg 


caggcagctt 


2619 


ccacagcaat 


ggcatcctgg 


tcatccagcg 


gatagttaat 


gatcagccca 


ctgacgcgtt 


2679 


gcgcgagaag 


attgtgcacc 


gccgctttac 


aggcttcgac 


gccgcttcgt 


tctaccatcg 


2739 


acaccaccac 


gctggcaccc 


agttgatcgg 


cgcgagattt 


aatcgccgcg 


acaatttgeg 


2799 


acggcgcgtg 


cagggccaga 


ctggaggtgg 


caacgccaat 


cagcaacgac 


tgtttgcccg 


2859 


ccagttgttg 


tgccacgcgg 


ttgggaatgt 


aattcagctc 


cgccatcgcc 


gcttccactt 


2919 


tttcccgcgt 


tttcgcagaa 


acgtggctgg 


cctggttcac 


cacgegggaa 


aeggtctgat 


2979 


aagagacacc 


ggcatactct 


gcgacatcgt 


ataacgttac 


tggtttcaca 


ttcaccaccc 


3039 


tgaattgact 


ctcttccggg 


cgctatcatg 


ccataccgcg 


aaaggttttg 


caccattcga 


3099 


tggtgtcgga 


atttcgggca 


gcgttgggtc 


ctggccacgg 


gtgcgcatga 


tctagagctg 


3159 


cctcgcgcgt 


ttcggtgatg 


acggtgaaaa 


cctctgacac 


atgcagctcc 


eggagaeggt 


3219 


cacagcttgt 


ctgtaagcgg 


atgccgggag 


cagacaagcc 


cgtcagggcg 


cgtcagcggg 


3279 


tgttggcggg 


tgtcggggcg 


cagccatgac 


ccagtcacgt 


agegatageg 


gagtgtatac 


3339 


tggcttaact 


atgcggcatc 


agagcagatt 


gtactgagag 


tgcaccatat 


gcggtgtgaa 


3399 


ataccgcaca 


gatgcgtaag 


gagaaaatac 


cgcatcaggc 


gctcttccgc 


ttcctcgctc 


3459 


actgactcgc 


tgcgctcggt 


cgttcggctg 


cggcgagcgg 


tatcagctca 


etcaaaggeg 


3519 


gtaatacggt 


tatccacaga 


atcaggggat 


aacgcaggaa 


agaacatgtg 


agcaaaaggc 


3579 


cagcaaaagg 


ccaggaaccg 


taaaaaggcc 


gcgttgctgg 


cgtttttcca 


taggctccgc 


3639 


ccccctgacg 


agcatcacaa 


aaatcgacgc 


tcaagtcaga 


ggtggcgaaa 


cccgacagga 


3699 


ctataaagat 


accaggcgtt 


tccccctgga 


agctccctcg 


tgcgctctcc 


tgttccgacc 


3759 


ctgccgctt a 


ccggat acct 


gt ccgcctt t 


ct ccctt egg 


gaagcgt ggc 


get ttctcat 


3819 


agctcacgct 


gtaggtatct 


cagttcggtg 


taggtegtte 


gctccaagct 


gggctgtgtg 


3879 


cacgaacccc 


ccgttcagcc 


cgaccgctgc 


gccttatccg 


gtaactatcg 


tcttgagtcc 


3939 


aacccggtaa 


gacacgactt 


atcgccactg 


gcagcagcca 


ctggtaacag 


gattagcaga 


3999 



f 



gcgaggtatg 


taggcggtgc 


tacagagttc 


ttgaagtggt 


ggcctaacta 


cggctacact 


4059 


agaaggacag 


tatttggtat 


ctgcgctctg 


ctgaagccag 


ttaccttcgg 


aaaaagagtt 


4119 


ggtagctctt 


gatccggcaa 


acaaaccacc 


gctggtagcg gtggtttttt 


tgtttgcaag 


4179 


cagcagatta 


cgcgcagaaa 


aaaaggatct 


caaaaaaatc 


ctttgatctt 


ttctacgggg 


4239 


tctgacgctc 


agtggaacga 


aaactcacgt 


i~ ri a crcrna i~ t" t" 

a. v_-i y \j a. i~ v- l. 


tggtcatgag 


attatcaaaa 


4299 


aggatcttca 


cctagatcct 


tttaaattaa 


CICICL UyGGy l_ L. 


ttaaatcaat 


ctaaagtata 


4359 


tatgagtaaa 


cttggtctga 


cagttaccaa 




gtgaggcacc tatctcagcg 


4419 


atctgtctat 


ttcgttcatc 


catagttgcc 


Lvj GL- <— ^ 


tcgtgtagat 


aactacgata 


4479 


cgggagggct 


taccatctgg 


ccccagtgct 


gcaatgatac 


cgcgagaccc 


acgctcaccg 


4539 


gctccagatt 


tatcagcaat 


aaaccagcca 


gccggaaggg 


ccgagcgcag 


aagtggtcct 


4599 


gcaactttat 


ccgcctccat 


ccagtctatt 


aattgttgcc 


gggaagctag 


agtaagtagt 


4659 


tcgccagtta 


atagtttgcg 


caacgttgtt 


gccattgcta 


caggcatcgt 


ggtgtcacgc 


4719 


tcgtcgtttg 


gtatggcttc 


attcagctcc 


ggttcccaac 


gatcaaggcg agttacatga 


4779 


tcccccatgt 


tgtgcaaaaa 


agcggttagc 


tccttcggtc 


ctccgatcgt 


tgtcagaagt 


4839 


aagttggccg 


cagtgttatc 


actcatggtt 


atggcagcac tgcataattc tcttactgtc 


4899 


atgccatccg 


taagatgctt 


ttctgtgact 


ggtgagtact 


caaccaagtc 


attctgagaa 


4959 


tagtgtatgc 


ggcgaccgag 


ttgctcttgc 


ccggcgtcaa 


tacgggataa 


taccgcgcca 


5019 


catagcagaa 


ctttaaaagt 


gctcatcatt 


ggaaaacgtt 


cttcggggcg 


aaaactctca 


5079 


aggatcttac 


cgctgttgag 


atccagttcg 


atgtaaccca 


ctcgtgcacc 


caactgatct 


5139 


tcagcatctt 


ttactttcac 


cagcgtttct 


gggtgagcaa 


aaacaggaag 


gcaaaatgcc 


5199 


gcaaaaaagg 


gaataagggc 


gacacggaaa 


tgttgaatac tcatactctt 


cctttttcaa 


5259 


tattattgaa 


gcatttatca 


gggttattgt 


ctcatgagcg gatacatatt 


tgaatgtatt 


5319 


tagaaaaata 


aacaaatagg 


ggttccgcgc 


acatttcccc 


gaaaagtgcc 


acctgacgtc 


5379 


taagaaacca 


ttattatcat 


gacattaacc 


tataaaaata 


ggcgtatcac 


gaggcccttt 


5439 


cgtcttcac 












5448 



<210> 3 
<211> 1182 
<212> DNA 



<213> 



Escherichia coli 



<220> 

<223> Sequence shows CDS portion (115-1293) of the expression plasmid, 
pPA-GBP-PA for fusion protein His6-protein A-GBP-protein A. The 
total length of the plasmid is 5859 base pairs. The origin of the 
basic vector and other CDSs is the same as pPA-GBP. 



<220> 

<221> CDS 

<222> (1)..(1179) 

<223> CDS for fusion protein His6-protein A-GBP-protein A consisting of 
393 amino acid residues. 

<400> 3 

atg aga gga teg cat cac cat cac cat cac gga tec ggt tct ggt gcg 4 8 

Met Arg Gly Ser His His His His His His Gly Ser Gly Ser Gly Ala 
15 10 15 

caa cac gat gaa gec gta gac aac aaa ttc aac aaa gaa caa caa aac 96 
Gin His Asp Glu Ala Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn 
20 25 30 

gcg ttc tat gag ate tta cat tta cct aac tta aac gaa gaa caa cga 14 4 

Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg 
35 40 45 

aac gec ttc ate caa agt tta aaa gat gac cca age caa age get aac 192 
Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn 
50 55 60 

ctt tta gca gaa get aaa aag eta aat gat get cag gcg ccg aaa gta 240 
Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Val 
65 70 75 80 

gac aac aaa ttc aac aaa gaa caa caa aac gcg ttc tat gag ate tta 288 
Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu 
85 90 95 

cat tta cct aac tta aac gaa gaa caa cga aac gee ttc ate caa agt 336 
His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser 
100 105 110 

tta aaa gat gac cca age caa age get aac ctt tta gca gaa get aaa 384 
Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys 
115 120 125 

aag eta aat gat get cag gcg ccg aaa gta gac gcg aat teg age tct 4 32 

Lys Leu Asn Asp Ala Gin Ala Pro Lys Val Asp Ala Asn Ser Ser Ser 
130 135 140 

ggt agt ggc aat ggt cat atg cat gga aaa act cag gca ace age ggg 4 80 

Gly Ser Gly Asn Gly His Met His Gly Lys Thr Gin Ala Thr Ser Gly 
145 150 155 160 



to 



act ate cag age atg cat gga aaa act cag gca acc age ggg act ate 528 
Thr He Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He 
165 170 175 

cag age atg cat gga aaa act cag gca acc age ggg act ate cag age 576 
Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser 
180 185 190 

atg cat gga aaa act cag gca acc age ggg act ate cag age atg cat 624 
Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His 

195 200 205 

gga aaa act cag gca acc age ggg act ate cag age atg cat gga aaa 672 
Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys 
210 215 220 

act cag gca acc age ggg act ate cag age atg cat gga aaa att cag 720 
Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys He Gin 

225 230 235 240 

gca acc age ggg act ate cag age atg cat get ctg tec etc gag ggt 768 
Ala Thr Ser Gly Thr He Gin Ser Met His Ala Leu Ser Leu Glu Gly 

245 250 255 

ggc gga tec ggt tct ggt gcg caa cac gat gaa gee gta gac aac aaa 816 
Gly Gly Ser Gly Ser Gly Ala Gin His Asp Glu Ala Val Asp Asn Lys 

260 265 270 

ttc aac aaa gaa caa caa aac gcg ttc tat gag ate tta cat tta cct 8 64 

Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu He Leu His Leu Pro 

275 280 285 

aac tta aac gaa gaa caa cga aac gee ttc ate caa agt tta aaa gat 912 
Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe He Gin Ser Leu Lys Asp 

290 295 300 

gac cca age caa age get aac ctt tta gca gaa get aaa aag eta aat 960 
Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn 
305 310 315 320 

gat get cag gcg ccg aaa gta gac aac aaa ttc aac aaa gaa caa caa 1008 
Asp Ala Gin Ala Pro Lys Val Asp Asn Lys Phe Asn Lys Glu Gin Gin 

325 330 335 

aac gcg ttc tat gag ate tta cat tta cct aac tta aac gaa gaa caa 1056 
Asn Ala Phe Tyr Glu He Leu His Leu Pro Asn Leu Asn Glu Glu Gin 

340 345 350 

cga aac gee ttc ate caa agt tta aaa gat gac cca age caa age get 1104 
Arg Asn Ala Phe He Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala 
355 360 365 

aac ctt tta gca gaa get aaa aag eta aat gat get cag gcg ccg aaa 1152 
Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys 

370 375 380 

gta gac gcg aat teg age tct ggt ggc taa 1182 



(I 



Val Asp Ala Asn Ser Ser Ser Gly Gly 
385 390 



<210> 4 

<211> 1170 

<212> DNA 

<213> Escherichia coli 



<220> 

<223> CDS portion (115-1281) of the expression plasmid, 

pStrept-GBP-Strept (total length 5833 base pairs) for fusion 
protein His6-streptavidin-GBP-streptavidin. Vector and other CDSs 
are the same as pPA-GBP and pStreptavidin-GBP. 



<220> 

<221> CDS 

<222> (1)..(1167) 

<223> CDS for fusion protein His6-streptavidin-GBP-streptavidin 
consisting of 389 amino acid residues. 

<400> 4 

atg aga gga teg cat cac cat cac cat cac gga tec ggt tct ggt ggc 4 8 

Met Arg Gly Ser His His His His His His Gly Ser Gly Ser Gly Gly 
15 10 15 

cat atg get gaa get ggt ate acc ggc ace tgg tac aac cag ctg gga 96 
His Met Ala Glu Ala Gly lie Thr Gly Thr Trp Tyr Asn Gin Leu Gly 
20 25 30 

tec acc ttc ate gtt acc get ggt get gac ggt get ctg acc ggt acc 14 4 

Ser Thr Phe He Val Thr Ala Gly Ala Asp Gly Ala Leu Thr Gly Thr 
35 40 45 

tac gaa tec get gtt ggt aac get gaa tct aga tac gtt ctg acc ggt 192 
Tyr Glu Ser Ala Val Gly Asn Ala Glu Ser Arg Tyr Val Leu Thr Gly 
50 55 60 

cgt tac gac tec get ccg get acc gac ggt tec gga acc get ctg ggt 24 0 

Arg Tyr Asp Ser Ala Pro Ala Thr Asp Gly Ser Gly Thr Ala Leu Gly 
65 70 75 80 

tgg acc gtt get tgg aaa aac aac tac cgt aac get cac tec get acc 288 
Trp Thr Val Ala Trp Lys Asn Asn Tyr Arg Asn Ala His Ser Ala Thr 
85 ^ 90 95 

acc tgg tct ggc cag tac gtt ggt ggt get gaa get cgt ate aac acc 336 
Thr Trp Ser Gly Gin Tyr Val Gly Gly Ala Glu Ala Arg He Asn Thr 
100 105 110 

cag tgg ttg ttg acc tec ggc acc acc gaa get aac gcg tgg aaa tec 384 
Gin Trp Leu Leu Thr Ser Gly Thr Thr Glu Ala Asn Ala Trp Lys Ser 
115 120 125 



12- 



acc ctg gtt ggt cac gac acc ttc acc aaa gtt teg age tct ggt agt 432 
Thr Leu Val Gly His Asp Thr Phe Thr Lys Val Ser Ser Ser Gly Ser 
130 135 140 

ggc aat ggt cat atg cat gga aaa act cag gca acc age ggg act ate 480 
Gly Asn Gly His Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He 
145 150 155 160 

cag age atg cat gga aaa act cag gca acc age ggg act ate cag age 528 
Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser 

165 ~ 170 175 

atg cat gga aaa act cag gca acc age ggg act ate cag age atg cat 57 6 

Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His 
180 185 190 

gga aaa act cag gca acc age ggg act ate cag age atg cat gga aaa 624 
Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys 
195 200 205 

act cag gca acc age ggg act ate cag age atg cat gga aaa act cag 672 
Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys Thr Gin 
210 215 220 

gca acc age ggg act ate cag age atg cat gga aaa att cag gca acc 720 
Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys He Gin Ala Thr 

225 230 235 240 

age ggg act ate cag age atg cat get ctg tec etc gag gga tct ggt 768 
Ser Gly Thr He Gin Ser Met His Ala Leu Ser Leu Glu Gly Ser Gly 

245 250 255 

tct ggt ggc cat atg get gaa get ggt ate acc ggc acc tgg tac aac 816 
Ser Gly Gly His Met Ala Glu Ala Gly He Thr Gly Thr Trp Tyr Asn 

260 265 270 

cag ctg gga tec acc ttc ate gtt acc get ggt get gac ggt get ctg 864 
Gin Leu Gly Ser Thr Phe He Val Thr Ala Gly Ala Asp Gly Ala Leu 

275 280 285 

acc ggt acc tac gaa tec get gtt ggt aac get gaa tct aga tac gtt 912 
Thr Gly Thr Tyr Glu Ser Ala Val Gly Asn Ala Glu Ser Arg Tyr Val 
290 295 300 

ctg acc ggt cgt tac gac tec get ccg get acc gac ggt tec gga acc 960 
Leu Thr Gly Arg Tyr Asp Ser Ala Pro Ala Thr Asp Gly Ser Gly Thr 
305 ' 310 315 320 

get ctg ggt tgg acc gtt get tgg aaa aac aac tac cgt aac get cac 1008 
Ala Leu Gly Trp Thr Val Ala Trp Lys Asn Asn Tyr Arg Asn Ala His 

325 330 335 

tec get acc acc tgg tct ggc cag tac gtt ggt ggt get gaa get cgt 1056 
Ser Ala Thr Thr Trp Ser Gly Gin Tyr Val Gly Gly Ala Glu Ala Arg 

340 345 350 

ate aac acc cag tgg ttg ttg acc tec ggc acc acc gaa get aac gcg 1104 



lie Asn Thr Gin Trp Leu Leu Thr Ser Gly Thr Thr Glu Ala Asn Ala 
355 360 365 



tgg aaa tec acc ctg gtt ggt cac gac acc ttc acc aaa gtt teg age 1152 

Trp Lys Ser Thr Leu Val Gly His Asp Thr Phe Thr Lys Val Ser Ser 
370 375 380 

tea age tta att age tga 1170 

Ser Ser Leu lie Ser 

385 



<210> 5 

<211> 1176 

<212> DNA 

<213> Escherichia coli 



<220> 

<223> CDS portion (115-1287) of the expression plasmid, 

pPA-GBP-Streptavidin (total length 5839 base pairs) for fusion 
protein His6-PA-GBP-streptavidin. Vector and other CDSs are the 
same as pPA-GBP and pStreptavidin-GBP. 



<220> 

<221> CDS 

<222> (1)..(1173) 

<223> CDS for fusion protein His6-PA-GBP-streptavidin consisting of 391 
amino acid residues. 

<400> 5 

atg aga gga teg cat cac cat cac cat cac gga tec ggt tct ggt gcg 4 8 

Met Arg Gly Ser His His His His His His Gly Ser Gly Ser Gly Ala 
15 10 15 

caa cac gat gaa gee gta gac aac aaa ttc aac aaa gaa caa caa aac 96 
Gin His Asp Glu Ala Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn 
20 25 30 

gcg ttc tat gag ate tta cat tta cct aac tta aac gaa gaa caa cga 144 
Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg 
35 40 45 

aac gee ttc ate caa agt tta aaa gat gac cca age caa age get aac 192 
Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn 
50 55 60 



ctt tta gca gaa get aaa aagcta aat gat get cag gcg ccg aaa gta 240 
Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Val 
65 70 75 80 

gac aac aaa ttc aac aaa gaa caa caa aac gcg ttc tat gag ate tta 288 
Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu 
85 90 95 



it 



cat tta cct aac tta aac gaa gaa caa cga aac gcc ttc ate caa agt 336 
His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser 
100 105 110 

tta aaa gat gac cca age caa age get aac ctt tta gca gaa get aaa 384 
Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys 
115 120 125 

aag eta aat gat get cag gcg ccg aaa gta gac gcg aat teg age tct 4 32 

Lys Leu Asn Asp Ala Gin Ala Pro Lys Val Asp Ala Asn Ser Ser Ser 

130 ' 135 140 

ggt agt ggc aat ggt cat atg cat gga aaa act cag gca ace age ggg 4 80 

Gly Ser Gly Asn Gly His Met His Gly Lys Thr Gin Ala Thr Ser Gly 
145 150 155 160 

act ate cag age atg cat gga aaa act cag gca ace age ggg act ate 528 
Thr He Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He 

165 170 175 

cag age atg cat gga aaa act cag gca ace age ggg act ate cag age 57 6 

Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser 
180 185 190 

atg cat gga aaa act cag gca ace age ggg act ate cag age atg cat 624 
Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His 
195 200 205 

gga aaa act cag gca ace age ggg act ate cag age atg cat gga aaa 672 
Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys 
210 215 220 

act cag gca ace age ggg act ate cag age atg cat gga aaa att cag 720 
Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys He Gin 

225 230 235 240 

gca acc age ggg act ate cag age atg cat get ctg tec etc gag gga 7 68 

Ala Thr Ser Gly Thr He Gin Ser Met His Ala Leu Ser Leu Glu Gly 

245 250 255 

tct ggt tct ggt ggc cat atg get gaa get ggt ate acc ggc acc tgg 816 
Ser Gly Ser Gly Gly His Met Ala Glu Ala Gly He Thr Gly Thr Trp 

260 265 270 

tac aac cag ctg gga tec acc ttc ate gtt acc get ggt get gac ggt 8 64 

Tyr Asn Gin Leu Gly Ser Thr Phe He Val Thr Ala Gly Ala Asp Gly 
275 280 285 

get ctg acc ggt acc tac gaa tec get gtt ggt aac get gaa tct aga 912 
Ala Leu Thr Gly Thr Tyr Glu Ser Ala Val Gly Asn Ala Glu Ser Arg 

290 295 300 

tac gtt ctg acc ggt cgt tac gac tec get ccg get acc gac ggt tec 960 
Tyr Val Leu Thr Gly Arg Tyr Asp Ser Ala Pro Ala Thr Asp Gly Ser 
305 310 315 320 



gga acc get ctg ggt tgg ace gtt get tgg aaa aac aac tac cgt aac 1008 
Gly Thr Ala Leu Gly Trp Thr Val Ala Trp Lys Asn Asn Tyr Arg Asn 

325 330 335 

get cac tec get acc acc tgg tct ggc cag tac gtt ggt ggt get gaa 1056 
Ala His Ser Ala Thr Thr Trp Ser Gly Gin Tyr Val Gly Gly Ala Glu 
340 345 350 

get cgt ate aac acc cag tgg ttg ttg acc tec ggc acc acc gaa get 1104 
Ala Arg lie Asn Thr Gin Trp Leu Leu Thr Ser Gly Thr Thr Glu Ala 

355 360 365 

aac gcg tgg aaa tec acc ctg gtt ggt cac gac acc ttc acc aaa gtt 1152 
Asn Ala Trp Lys Ser Thr Leu Val Gly His Asp Thr Phe Thr Lys Val 
370 375 380 

teg age tea age tta att age tga 1176 
Ser Ser Ser Ser Leu lie Ser 
385 390 



<210> 6 

<211> 1176 

<212> DNA 

<213> Escherichia coli 



<220> 

<223> CDS portion (115-1287) of the expression plasmid, 

pStreptavidin-GBP-PA (total length 5853 base pairs) for fusion 
protein His6-streptavidin-GBP-PA. Vector and other CDSs are the 
same as pPA-GBP and pStreptavidin-GBP. 



<220> 

<221> CDS 

<222> (1)..(1173) 

<223> CDS for fusion protein His6-streptavidin-GBP-PA consisting of 391 
amino acid residues. 

<400> 6 

atg aga gga teg cat cac cat cac cat cac gga tec ggt tct ggt ggc 48 

Met Arg Gly Ser His His His His His His Gly Ser Gly Ser Gly Gly 
15 10 15 

cat atg get gaa get ggt ate acc ggc acc tgg tac aac cag ctg gga 96 
His Met Ala Glu Ala Gly He Thr Gly Thr Trp Tyr Asn Gin Leu Gly 

20 25 30 

tec acc ttc ate gtt acc get ggt get gac ggt get ctg acc ggt acc 144 
Ser Thr Phe He Val Thr Ala Gly Ala Asp Gly Ala Leu Thr Gly Thr 
35 40 45 

tac gaa tec get gtt ggt aac get gaa tct aga tac gtt ctg acc ggt 192 
Tyr Glu Ser Ala Val Gly Asn Ala Glu Ser Arg Tyr Val Leu Thr Gly 



lb 



50 55 60 

cgt tac gac tec get ccg get ace gac ggt tec gga ace get ctg ggt 240 
Arg Tyr Asp Ser Ala Pro Ala Thr Asp Gly Ser Gly Thr Ala Leu Gly 
65 70 75 80 

tgg ace gtt get tgg aaa aac aac tac cgt aac get cac tec get ace 288 
Trp Thr Val Ala Trp Lys Asn Asn Tyr Arg Asn Ala His Ser Ala Thr 
85 90 95 

ace tgg tct ggc cag tac gtt ggt ggt get gaa get cgt ate aac ace 336 
Thr Trp Ser Gly Gin Tyr Val Gly Gly Ala Glu Ala Arg He Asn Thr 
100 105 110 

cag tgg ttg ttg acc tec ggc ace ace gaa get aac gcg tgg aaa tec 384 
Gin Trp Leu Leu Thr Ser Gly Thr Thr Glu Ala Asn Ala Trp Lys Ser 
115 120 125 

acc ctg gtt ggt cac gac acc ttc acc aaa gtt teg age tct ggt agt 432 
Thr Leu Val Gly His Asp Thr Phe Thr Lys Val Ser Ser Ser Gly Ser 
130 135 140 

ggc aat ggt cat atg cat gga aaa act cag gca acc age ggg act ate 4 80 

Gly Asn Gly His Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He 
145 150 155 160 

cag age atg cat gga aaa act cag gca acc age ggg act ate cag age 528 
Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser 
165 170 175 

atg cat gga aaa act cag gca acc age ggg act ate cag age atg cat 57 6 

Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His 
180 185 190 

gga aaa act cag gca acc age ggg act ate cag age atg cat gga aaa 624 
Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys 
195 200 205 

act cag gca acc age ggg act ate cag age atg cat gga aaa act cag 672 
Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys Thr Gin 
210 215 220 

gca acc age ggg act ate cag age atg cat gga aaa att cag gca acc 720 
Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys He Gin Ala Thr 
225 230 235 240 

age ggg act ate cag age atg cat get ctg tec etc gag ggt ggc gga 7 68 

Ser Gly Thr He Gin Ser Met His Ala Leu Ser Leu Glu Gly Gly Gly 
245 250 255 

tec ggt tct ggt gcg caa cac gat gaa gee gta gac aac aaa ttc aac 816 
Ser Gly Ser Gly Ala Gin His Asp Glu Ala Val Asp Asn Lys Phe Asn 
260 265 270 

aaa gaa caa caa aac gcg ttc tat gag ate tta cat ft a cct aac tta 8 64 

Lys Glu Gin Gin Asn Ala Phe Tyr Glu He Leu His Leu Pro Asn Leu 
275 280 285 



in 



aac gaa gaa caa cga aac gcc ttc ate caa agt tta aaa gat gac cca 912 
Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro 
290 295 300 

age caa age get aac ctt tta gca gaa get aaa aag eta aat gat get 960 
Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala 

305 310 315 320 

cag gcg ccg aaa gta gac aac aaa ttc aac aaa gaa caa caa aac gcg 1008 
Gin Ala Pro Lys Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 

325 330 335 

ttc tat gag ate tta cat tta cct aac tta aac gaa gaa caa cga aac 1056 
Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn 

340 345 350 

gcc ttc ate caa agt tta aaa gat gac cca age caa age get aac ctt 1104 
Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu 

355 " 360 365 

tta gca gaa get aaa aag eta aat gat get cag gcg ccg aaa gta gac 1152 
Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Val Asp 

370 375 380 

gcg aat teg age tct ggt ggc taa 1176 
Ala Asn Ser Ser Ser Gly Gly 

385 390 



<210> 7 

<211> 393 

<212> DNA 

<213> Escherichia coli 



<220> 

<223> CDS portion (115-504) of the expression plasmid, pGBP (total 

length 5073 base pairs) for fusion protein His6-GBP. Vector and 
other CDSs are the same as p PA-GBP. 



<220> 

<221> CDS 

<222> (1) . . (390) 

<223> CDS for fusion protein His6-GBP consisting of 130 amino acid 
residues . 

<400> 7 

atg aga gga teg cat cac cat cac cat cac gga tec gga ggt ggg age 48 

Met Arg Gly Ser His His His His His His Gly Ser Gly Gly Gly Ser 
15 10 15 

tct ggt agt ggc aat ggt cat atg cat gga aaa act cag gca ace age 96 
Ser Gly Ser Gly Asn Gly His Met His Gly Lys Thr Gin Ala Thr Ser 



20 



25 



30 



ggg act ate cag age atg cat gga aaa act cag gca acc age ggg act 14 4 

Gly Thr He Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr 

35 40 45 

ate cag age atg cat gga aaa act cag gca acc age ggg act ate cag 192 

He Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin 
50 55 60 

age atg cat gga aaa act cag gca acc age ggg act ate cag age atg 240 

Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met 

65 70 75 80 

cat gga aaa act cag gca acc age ggg act ate cag age atg cat gga 288 

His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly 
85 90 95 

aaa act cag gca acc age ggg act ate cag age atg cat gga aaa att 336 

Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys He 
100 105 110 

cag gca acc age ggg act ate cag age atg cat get ctg tec etc gag 384 

Gin Ala Thr Ser Gly Thr He Gin Ser Met His Ala Leu Ser Leu Glu 

115 ~ 120 125 



ggt ccg taa 393 
Gly Pro 
130 



<210> 8 

<211> 741 

<212> DNA 

<213> Escherichia coli 



<220> 

<223> CDS portion (115-852) of the expression plasmid, pGBP-GBP (total 
length 5421 base pairs) for fusion protein His6-GBP-GBP . Vector 
and other CDSs are the same as pPA-GBP. 



<220> 

<221> CDS 

<222> (1)..(738) 

<223> CDS for fusion protein His6-GBP-GBP consisting of 24 6 amino acid 
residues . 

<400> 8 

atg aga gga teg cat cac cat cac cat cac gga tec gga ggt ggg age 48 

Met Arg Gly Ser His His His His His His Gly Ser Gly Gly Gly Ser 
15 10 15 



tct ggt agt ggc aat ggt cat atg cat gga aaa act cag gca acc age 96 

Ser Gly Ser Gly Asn Gly His Met His Gly Lys Thr Gin Ala Thr Ser 

20 25 30 

ggg act ate cag age atg cat gga aaa act cag gca acc age ggg act 144 

Gly Thr He Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr 

35 40 45 

ate cag age atg cat gga aaa act cag gca ace age ggg act ate cag 192 

He Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin 

50 55 60 

age atg cat gga aaa act cag gca acc age ggg act ate cag age atg 240 

Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met 

65 70 75 80 

cat gga aaa act cag gca acc age ggg act ate cag age atg cat gga 288 

His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly 

85 90 95 

aaa act cag gca acc age ggg act ate cag age atg cat gga aaa att 336 

Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys He 

100 105 110 

cag gca acc age ggg act ate cag age atg cat get ctg tec etc gag 384 

Gin Ala Thr Ser Gly Thr He Gin Ser Met His Ala Leu Ser Leu Glu 

115 120 125 

ggt ggt gga age tct ggt agt ggc aat ggt cat atg cat gga aaa act 4 32 

Gly Gly Gly Ser Ser Gly Ser Gly Asn Gly His Met His Gly Lys Thr 
130 135 140 

cag gca acc age ggg act ate cag age atg cat gga aaa act cag gca 480 

Gin Ala Thr Ser Gly Thr He Gin Ser Met His Gly Lys Thr Gin Ala 

145 150 155 160 

acc age ggg act ate cag age atg cat gga aaa act cag gca acc age 528 

Thr Ser Gly Thr He Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser 

165 170 175 

ggg act ate cag age atg cat gga aaa act cag gca acc age ggg act 576 

Gly Thr He Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr 

180 185 190 

ate cag age atg cat gga aaa act cag gca acc age ggg act ate cag 624 

He Gin Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin 

195 200 205 

age atg cat gga aaa act cag gca acc age ggg act ate cag age atg 672 

Ser Met His Gly Lys Thr Gin Ala Thr Ser Gly Thr He Gin Ser Met 
210 215 220 

cat gga aaa att cag gca acc age ggg act ate cag age atg cat get 720 

His Gly Lys He Gin Ala Thr Ser Gly Thr He Gin Ser Met His Ala 

225 230 235 240 



ctg tec etc gag ggt ccg taa 



741 



Leu Ser Leu Glu Gly Pro 
245 



